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Abstract 

Background: The archeaon Sulfolobus solfataricus P2 encodes a thermoacidophilic cellulase which shows an 
extreme acid and thermal stability with a pH optimum at 1.8 and a temperature optimum at 80°C. This 
extraordinary enzyme could be useful for biotechnological exploitation but the expression and purification in 
expression hosts like E. coli is unsatisfactory due to the high aggregation tendency of the recombinant enzyme. The 
thermophilic cellulase CelA from Thermotoga moritimo belongs to the same glycoside hydrolase family (GH12) but 
has a neutral pH optimum. In contrast to SS01949 this enzyme is expressed partially soluble in £ coli. 

Results: We aimed to constructed a hybrid enzyme based on these two beta-endoglucanases which should 
successfully combine the advantageous properties of both cellulases, i.e. recombinant expression in £ coli, 
acidophily and thermophily. We constructed two hybrid proteins after bioinformatic analysis: both hybrids are 
expressed insoluble in £ coli, but one hybrid enzyme was successfully refolded from washed inclusion bodies. 

Conclusions: The refolded active chimeric enzyme shows a temperature optimum of approximately 85°C and a pH 
optimum of approximately pH 3 thus retaining the advantageous properties of the Sulfolobus parent enzyme. This 
study suggests that the targeted construction of chimeric enzymes is an alternative to point mutational 
engineering efforts as long as parent enzymes with the wanted properties are available. 



Background 

Cellulose is the most abundant biopolymer on earth and 
the main component of plant cell walls. Cellulose makes 
35-50% of the dry weight of plants [1] and represents an 
important alternative source of renewable energy [2]. 
Cellulose is a linear biopolymer of ft-l,4-glycosidic linked 
D-glucose molecules. Cellulose molecules usually consist 
of several thousand glucose units and can form larger 
crystalline structures via intermolecular hydrogen bonding. 
For the non-enzymatic hydrolysis of crystalline cellulose 
high temperatures combined with extreme pH conditions 
are required [3]. 

Cellulose can also be hydrolysed under milder condi- 
tions by special enzymes called cellulases. Cellulases 
catalyze the cleavage of ft-l,4-glycosidic bonds in the 
cellulose. Because of their mode of action and substrate 
specificity they can be classified into exoglucanases (EC 
3.2.1.91), endoglucanases (EC 3.2.1.4) and E-glucosidases 
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(EC 3.2.1.21) [4]. Exoglucanases split off cellobiose and 
endoglucanases hydrolyze 13-1,4-glycosidic bonds to 
decrease the length of the cellulose chains. E-glucosidases 
subsequently hydrolyze short oligosaccharides such as 
cellobiose to glucose [5]. Based on amino acid sequence 
similarities cellulases may be classified into different GH 
(glycoside hydrolases) families [4,6]. To date there are 131 
GH families; cellulases (E.C. 3.2.1.4) are found in fam- 
ilies 5-10, 12, 18, 19, 26, 44, 45, 48, 51, 61, 74 and 
124 (http://www.cazy.org/Glycoside-Hydrolases.html). 
Family 12 comprises endoglucanases from mesophilic 
and thermophilic archaea, bacteria and fungi. 

The demand for stable and highly active cellulases is 
high [7]. Cellulose as renewable source is an ideal low-cost 
starting material for the production of bioethanol that can 
be used as an alternative to fossil fuels. Cellulose in contrast 
to starch and other agricultural biopolymers has the 
advantage that it does not compete with the nutritional 
demands [8]. To make cellulose accessible for enzymatic 
degradation, the biomass is pre-treated with high tem- 
peratures and strong acids. For the next degradation step 
extreme thermoacidophilic enzymes would be preferable. 
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Most commercial enzymes have a pH optimum near 
neutrality and are derived from the mesophilic fungus 
Trichoderma reesei In contrast, the cellulase SS01949 from 
the hyperthermophilic archeaon Sulfolobus solfataricus 
represents a thermoacidophilic enzyme, which is opti- 
mally adapted to work under acidic conditions and 
high temperatures. 

The enzyme SS01949 (molecular mass 37 kDa) has a 
pH-optimum at approximately 1.8 as well as a temperature 
optimum at approximately 80°C [9]. To our knowledge only 
the protease thermopsin from Sulfolobus acidocaldarius 
[10] shows a similar pH and temperature activity profile. 

The protein consists of a N-terminal signal peptide, a 
Ser/Thr-rich region and a catalytic domain which shows 
significant homology to cellulases of glycoside hydrolase 
family 12 [9]. However, when SS01949 is expressed in E. 
colU it is mostly insoluble and the preparation of active en- 
zyme through solubilisation and refolding is cumbersome. 
The inclusion bodies formed by SS01949 are urea-stable 
and can only be solubilized with 6 M guanidinium chloride. 
Refolding is possible by rapid dilution in 0.8 M arginine. 
However during purification of the refolded enzyme by cat- 
ion exchange chromatography the protein precipitates on 
the column and can only be eluted with guanidinium chlor- 
ide containing buffers. We also attempted to refold and 
purify SS01949 at acidic, neutral and alkaline pH values. 
However the best results were obtained at neutral pH. The 
high aggregation tendency of SS01949 precludes its use for 
commercial applications or further engineering studies. 

In the present study, we report the construction of 
hybrid proteins of the cellulases SS01949 from Sulfolobus 
solfataricus and CelA from the thermophilic bacterium 
Thermotoga maritima by in vitro recombination. In vitro 
recombination allows the combination and optimization of 
specific properties of different proteins. Ideally, the resulting 
protein combines the advantageous properties of the parent 
proteins. Recombination plays a key role in natural evolu- 
tion of proteins and in the development of antibodies, 
synthases and proteases [11]. CelA also belongs to GH 
family 12 and is expressed in our hands in a partially soluble 
form in E. coli, but shows a neutral pH optimum [12,13]. 
We used the program SCHEMA developed by the group of 
F. Arnold, in order to choose suitable boundaries for 
chimera construction [14]. Usually, the program SCHEMA 
is used to construct recombination libraries. Here we show 
that a more targeted approach with two selected parent 
enzymes is feasible by obtaining a chimeric enzyme with 
advantageous properties. 

Results 

Sequence analysis, expression and purification of the 
hybrid proteins 

The cellulases SS01949 from Sulfolobus solfataricus 
and CelA from Thermotoga maritima show sequence 



similarities and belong to GH family 12 (Figure 1). 
SS01949 has a temperature and pH optimum of 
approximately 80°C and approximately pH 1.8 whereas 
CelA shows maximum activity at approximately 90-95°C 
and neutral pH. 

For construction of the hybrid proteins the program 
SCHEMA was used. SCHEMA predicts favorable sites for 
in vitro recombination based on structural information 
[14]. We adapted the python scripts of SCHEMA in order 
to calculate the disruption energies of a fusion SS01949- 
CelA and CelA-SS01949. This analysis yielded two local 
minima for the disruption energy at alignment position 175 
and 220. However these constructs would have consisted 
mainly of one parent protein with the N-terminal part of 
about 100 amino acids substituted by the other parent 
protein. We have therefore not considered these predic- 
tions further. We then calculated the disruption energies 
for hybrid proteins of the structure CelA-SS01949-CelA 
and SS01949-CelA-SS01949. The heat maps of the disrup- 
tion energy as a function of the both recombination sites is 
shown for both cases in Figure 2. In these triangular shaped 
heat plots the diagonal represents the case where the 
middle protein fragment has a length of 20 alignment 
positions. Likewise proteins corresponding to areas close 
to the left or upper border contain a very short N-terminal 
or C-terminal fragment, respectively. The further away 
from the three borders the more equally distributed are the 
lengths of the three protein fragments. 

For the fusion protein CelA-SS01949-CelA we found 
low disruption energy for a protein fusion with the first 
recombination site at alignment position 166 and the 
second at alignment position 391 (predicted disruption 
energy 16, red circle in Figure 2A). This leads to a fusion 
protein of 70 amino acids CelA followed by 163 amino 
acids SS01949 and 29 amino acids CelA (Figure 1). We 
considered this fusion as promising as the center fragment 
including most of the active center is derived from 
SS01949 whose enzymatic properties should be retained in 
the hybrid enzyme. In the modeled chimeric protein the 
substrate cleft for the cellulose chain is from the right to 
the left and is lined by a curved ft-sheet (Figure 3). Indeed 
the majority of the substrate cleft and the active site are 
derived from SS01949 and only the substrate cleft for the 
non-reducing end stems from CelA. A low disruption 
energy is also found for a chimeric protein with the recom- 
bination sites at positions 260 and 340 (red dotted circle). 
However in this hybrid enzyme the parent enzyme 
SS01949 would make only a minor contribution of about 
60 amino acids to the center fragment. We therefore did 
not consider this possible chimeric enzyme with low 
disruption energy further. 

For the chimeric protein with the structure SS01949- 
CelA-Ssol949 we found three possible recombination 
combinations (see red circles in Figure 2B). The left- most 
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CelA 
SS01949 



MIMNKL YIIIVPIIVII VVGVI G GAIYLHHQSPNVKTSSITVTTNETTTLMSITTNTV 5 8 



SS0194 9 PTTVTPTTSSIP--QL PTPVYLNN 1 FYLEVNMWNAKTWNGNYTMV 114 



SS01949 FNPL TRTLSVSFNLTQVNPL QWTN GYPE I YVGRKPWDTS YAGN IFPMRIGN 165 

ceiA Bj^^E^B 3^^B E53BS3EiBiSEiE ^^^E ^B^B3E3EES 140 

SS0194 9 MTPFMVSFYINLTKL DPSINFDIASDAWIVRPQIAFSPGTAP-GNGDIEIMVWLF 219 

SS01949 ^^^^A^^^3^^^P^Y^^^^^^F^^^^^^^^^^IA^R^r^^^^^ 277 



CelA 
SS01949 



CelA 
SS01949 



VKFDVRHFLDAAGKALSSSARVKDFEE 



^FTVWEIGTEFGSPETK-SAQFGWKFENFS 



VAYEPNLFIKALNNFASY- 



-NI TNYYLTDWEFGTEWGTMTSNGTAYFSWT I SNFY 330 



257 
334 



B 



CelA-SS01949-CelA 
SS0194 9-CelA-SS0194 9 



CelA-SS01949-CelA 
SS0194 9-CelA-SS0194 9 



CelA-SS01949-CelA 
SS0194 9-CelA-SS0194 9 



AIYLHHQSPNVKTSSITVTTNETTTLMSITTNTV 



PTTVTPTTSSIP — QL I YVT S S AS S PT PVYLNNS TVP S FYLEVNMWNAKTWNGNYTMV 



VGRKPWDTSYAGN IFPMRIGN 

-TRTLSVSFNLTQVNPLQWTNGYPE I YVGRKPWDTS YAGN IFPMRIGN 



CelA-SS0194 9-CelA MTPFMVSFYINLTKL DPSINFDIASDAWIVRPQIAFSPGTAP-GNGDIEIMVWLF 

SS0194 9-CelA-SS0194 9 MTPFMVSFYINLTKL DPSINFDIASDAWIVRPQIAFSPGTAP-GNGDIEIMV05 



CelA-SS01949-CelA 
SS0194 9-CelA-SS0194 9 



SQNLQPAGQQVGEWIPIYINHTLVNATFQVWKMKNVPWGGWEYIAFRPDGWKVTNGY 



CelA-SS01949-CelA 
SS0194 9-CelA-SS0194 9 



CelA-SS01949-CelA 
SS0194 9-CelA-SS0194 9 



VAYEPNLFIKALNNFASY 




NITNYYLTDWEFg 

N I TNYYLTDWEFGTEWGTMTSNGTAYFSWT I SNFY 



Figure 1 Alignment of the cellulases SS01949 from S. solfataricus and CelA from T. maritima. A: Alignment of SS01949 (black letters on 
white background) and CelA (white on black). The signal peptide of SS01949 is in italics and the Ser/Thr-rich region is underlined. The conserved catalytic 
glutamic residues and valine 63 are marked with asterisks. The four amino acids mutated at the valine 63 position are double-underlined. B: Sequence of 
the both fusion proteins. Black letters are amino acids derived from the parental sequence of SS01949; white letters show residues derived from CelA. 



red broken circle would represent the counterpart of the 
chimera we have chosen for chimera CelA-SS01949-CelA 
(Figure 3A). We did not consider this possible chimeric 
enzyme as the center fragment including most of the 
active site would be derived from CelA which does not 
show thermoacidophilic properties we aim for. For the 
same reason we also refrained from constructing the 
chimera with the recombination sites around the alignment 
positions 210 and 390 (right broken red circle). This 
chimeric protein would also have a dominant center 
fragment derived from CelA. For the fusion SS01949-CelA 
-SS01949 we therefore choose the recombination sites at 
positions 272 and 366 (red circle) yielding a fusion protein 
of 193 amino acids SS01949 followed by 76 amino 
acids CelA and 39 amino acids SS01949. The predicted 
SCHEMA disruption energy of this chimeric enzyme is 



somewhat higher with 28 but the parent SS01949 makes 
a more prominent contribution to the chimeric protein 
than with the latter two possibilities. The modeled struc- 
ture (Figure 3B) shows that the catalytic center with the 
two catalytic glutamate residues is derived from S SO 1949 
whereas the reducing end of the substrate binding cleft 
comes from CelA. 

Both hybrid proteins were constructed and expressed 
in E. coli strain BL21 AI. The proteins were produced in 
high yields in E. coli but aggregated in inclusion bodies. 
The recombinant protein CelA-SS01949-CelA consists 
of 262 amino acids and migrates in SDS gels with an 
apparent mass of approximately 29 kDa, which agrees with 
the theoretical mass (Figure 4). SS01949-CelA-SS01949 
consists of 308 amino acids and shows a molecular weight 
of approximately 36 kDa. The activity of the hybrid proteins 
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Figure 2 Calculation of disruption energies for chimeric enzymes. The disruption energies were calculated in dependence of the two 
recombination sites for a chimeric protein with the structure protein A - protein B - protein A. The first recombination site is on the abscissa 
the second on the ordinate; the calculated disruption energies are visualized as a heat map. A: energies for the chimera CelA-SS01949-CelA. 
B: energies for the chimera SS01949-CelA-SS01949. The triangles and boxes along the axes represent beta-sheets and alpha-helices, respectively. 
Red circles refer to the recombination sites selected. Abcissa and ordinate refer to alignment position. The conserved domain of GH12 starts at 
about alignment position 100. 



was verified by CMC-plates. For this purpose the solubilised 
inclusion bodies were spotted directly onto carboxymethyl- 
cellulose plates. After incubating the plates overnight and 
staining with Congo Red, only CelA-SS01949-CelA shows 
activity. SS01949-CelA-SS01949 was inactive in this assay. 
Noteworthy, both parent enzymes expressed in £. coli 
also shows activity in this assay. We therefore concluded 



that the enzymatic activity of the fusion SS01949-CelA 
-SS01949 is severely compromised and continued working 
only with the hybrid CelA-SS01949-CelA. 

Because the hybrid protein was insoluble we applied an 
inclusion body washing step with increasing concentration 
of urea (Figure 4A) where a majority of the E. coli host 
proteins could be removed. Next we refolded the fusion 




Figure 3 Model of chimeric enzyme CelA|SS01949|CelA. A model structure of the hybrid CelA-SS01949-CelA (A) and of the hybrid SS01949- 
CelA-SS01949 (B) using the structure of CelA (3AMH) as template. The sequence alignment was performed with HHpred [15] and the structure 
calculation was performed with Modeller [16]. The part derived from the parent enzyme CelA is in purple; whereas the SS01949 part is shown in 
cyan. The catalytic glutamate residues are shown in orange sticks. It can be clearly seen that the chimeric enzymes consists of two distinct halves 
derived from both parent proteins. The structures are visualised with Chimera [17]. 
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Figure 4 Purification of recombinant hybrid protein CelA-SS01949-CelA. (A) Protein fractions were analysed by SDS-PAGE followed by 
Coomassie Blue staining: whole cell extract (lane WCE), crude extract (lane CE), Triton X extract (lane T), Urea extract at 5, 6 and 8 M urea, 
respectively (lanes 5MU, 6MU, 8MU). (B) Pooled and dialyzed protein fraction after refolding and hydrophobic interaction chromatography. 



protein. Previously we have screened the refolding of 
recombinant SS01949 with 96 buffers and found that 
0.8 M arginine containing refolding buffer yield active 
enzyme (data not shown). Refolding of the fusion protein 
CelA-SS01949-CelA with an arginine containing buffer 
was successful and the soluble and active enzyme was 
purified and concentrated by hydrophobic interaction 
chromatography with a propyl-column (Figure 4B). 

Enzymatic characterization of CelA-SS01949-CelA 

For detailed characterization of cellulase activity a FRET 
(fluorescence resonance energy transfer) -based assay was 
used. The substrate consists of 6 6-1,4-linked glucose 
units and carries a fluorophore (EDANS) at the reducing 
end and a chromophore at the non-reducing end [18]. 
Incubation of CelA-SS01949-CelA with the fluorescent 
cellohexaoside leads to an increase in fluorescence at 490 
nm, which indicates the cleavage of the substrate. The 
measured fluorescence is proportional to the number of 
hydrolysed substrate molecules. The FRET- assay is sensitive 
and the used substrate is even stable under the extreme pH 
and temperature conditions. 

Measurements of initial rates at various substrate 
concentrations yielded a K m value of 1.7 uM and the 
maximal velocity of 0.8 umol-min^mg" 1 at 80°C and pH 3. 
This value is close to the specific activity of SS01949 of 1.0 
umole-min^mg" 1 [9] but much lower than the specific 
activity of CelA. The maximal velocity of the hybrid protein 
translates to a k cat of 0.39 s" 1 under the assumption that the 
enzyme preparation is wholly active (Figure 5, Table 1). 
The hybrid protein did not hydrolyse the substrate 



j9-nitrophenyl-13-D-cellobioside. This property of the 
enzyme has been taken over from SS01949, which 
also does not degrade ^-nitrophenyl-ft-D-cellobioside [9]. 
Possibly the reducing end substrate binding cleft derived 
from SS01949 requires a sugar moiety at the +1 subsite. 

pH and temperature dependence of CelA-SS01949-CelA 

The hybrid protein CelA-SS01949-CelA shows like 
S SO 1949 an optimal activity at acidic pH and high 




0.0 £ ■ , ■ , ■ , ■ , ■ , 
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cellohexaoside [|iM] 

Figure 5 Michaelis-Menten kinetics of the chimeric enzyme 
CelA-SS01949-CelA at pH 3 and 80°C The initial rates are shown 
as a function of fluorescent cellohexaoside concentration. The points 
are measured rates and the line is the least-squared minimized fit of 
the data to the Michealis-Menten equation with k cat = 0.39 ± 0.01 s" 1 
and K M = 1.74 + 0.16 uM. 
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Table 1 pH and temperature optimum as well as kinetic parameters of SS01949 [9], CelA and the hybrid protein (this 
work) 





Optimal pH 


Optimal temperature 


kcat IS" 1 ] 


K M [uM] 


kcat/K M ts" 1 MM" 1 ] 




80°C1 uM cellohexaoside 


1 uM cellohexaoside 


80°C 


80°C 


80°C 


SS01949 


1.8 


80°C 


0.5 


2.0 


0.25 


CelA 


4.5 


95°C 


19 + 0.9 


5.5 ± 0.8 


3.5 


CelA-SS01949-CelA 


3 


85°C 


0.39 ±0.01 


1.74 ±0.15 


0.23 



temperatures. The activity measurements with the 
FRET-substrate at different pH values reveal a pH profile 
with an optimum at pH 3 (Figure 6). The parents 
SS01949 and CelA have a pH optimum at 1.8 for 
SS01949 and pH 4.5 for CelA. Endpoint measurements 
for the hybrid protein revealed a temperature optimum 
at 85°C. The temperature dependence of the activity 




T[K] 

Figure 6 Activity profile of CelA-SS01949-CelA at different pH 

values and temperatures. (A) The cellulase activity of CelA-SS01949 

-CelA was assayed at different pH values. The enzyme (120 ng) was 

incubated with 1 uM fluorescent cellohexaoside in 100 mM phosphate 

buffer. (B) Temperature optimum of the chimeric enzyme in 100 mM 

sodium phosphate buffer (pH 3): After correcting the temperature 

dependent fluorescence intensity the activity of the hybrid protein was 

calculated (•). At temperatures above 80°C the enzymatic hydrolysis as 

determined with end-point determination after 60 s (o). The inset 

shows the Arrhenius analysis with reciprocal abscissa and logarithmic 

ordinate. The enzyme works best at 85°C and pH 3. 
\ ) 



allows calculating the activation energy of the enzymatic 
reaction which is approximately 76 kj/mol (see inset of 
Figure 6B). The activation energy for the parent enzyme 
SS01949 is 59 kj/mol [9]. 

Discussion 

The hyperthermophilic cellulase SS01949 is optimally 
adapted to work under acidic conditions and high 
temperatures. Because of these unique properties, SS01949 
is a good starting point to develop thermoacidophilic cellu- 
lases for bio technological purposes. SS01949 is expressed 
insoluble in E. coli and has a great tendency to aggregate 
even after refolding therefore large scale production of this 
protein is difficult to achieve. The thermophilic enzyme 
CelA in contrast, is expressed in partly soluble form in E. 
coli, but shows the highest enzymatic activity at around pH 
4.5 for the fluorescent cellohexaoside and pH 6.5 for 
j?-nitrophenyl-:R-D-cellobioside (data not shown). 

The aim of this study was to develop a hybrid protein 
in which the three positive characteristics thermophily, 
acidophily and improved solubility can be combined. 

Before constructing the hybrid proteins we used the 
software SCHEMA to assess putative recombination points 
for both parent enzymes. The calculations suggested two 
chimeric proteins, which were then further pursued. 
CelA-SS01949-CelA was N- and C-terminally flanked 
by CelA with the biggest part of the catalytic region 
of SS01949. The second protein SS01949-CelA-SS01949 
consists mainly of SS01949. Only a part of substrate bind- 
ing cleft is replaced by CelA. Both hybrid proteins were 
overexpressed, but in an insoluble form. Furthermore the 
enzyme SS01949-CelA-SS01949 proved to be inactive 
after refolding attempts. Possibly, in this chimeric protein 
critical interactions necessary for catalysis have been 
disturbed. Noteworthy the disruption energy calculated by 
SCHEMA was also nearly twofold as high as for the other 
chimeric enzyme. 

We were more successful with the hybrid protein 
CelA-SS01949-CelA. The protein also aggregated in 
inclusion bodies but could be refolded into active protein 
and was far less prone to aggregation than the parent 
SS01949. The solubility of the hybrid protein is consider- 
ably improved in comparison with SS01949. The hybrid 
protein can be completely purified from inclusion bodies 
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using 8 M urea. For purification of SS01949 stronger 
denaturants such as guanidinium chloride was needed. 

To compare the enzymatic activity of CelA-SS01949- 
CelA with SS01949 and CelA the enzymatic charac- 
terization was done with a FRET substrate. The hybrid 
protein showed a roughly bell-shaped pH profile, which 
is caused by the ionization states of the two catalytic 
acidic residues. The pH optimum was approximately at 
pH 3 which is almost exactly in the middle of the two 
pH optima of SS01949 (pH 1.8) and CelA (pH 4.5). By 
endpoint measurement a temperature optimum of 85°C 
for CelA-SS01949-CelA could be determined. The 
specific activity of the hybrid is lower than the specific 
activities of the parent proteins. At pH 3 and 80°C 
the specific activity of CelA-SS01949-CelA was 0.55 
(imole-min^mg 1 with the non-saturating substrate 
concentration of 1 \iM cellohexaoside. The turnover 
number k cat under the assumption of a fully active 
enzyme preparation and saturating concentrations of 
the FRET-substrate is 0.39 s" 1 . This corresponds to a 
Kcat/K M value of 2.3 x 10 5 s^M" 1 . 

The molecular basis for the extremely low pH 
optimum of SS01949 is currently unknown. There have 
been a few attempts to influence the pH optimum of 
endoglucanases by substituting selected amino acid 
residues in the neighborhood of the catalytic glutamate 
residues. This approach has been very successful in the 
case of the Xylanase C from Aspergillus kawachii. This 
enzyme belongs to glycoside hydrolase family 11 which 
is structurally similar to the glycoside hydrolase family 
12. In this enzyme the exchange of an aspartate residue 
to an asparagine residue raised the pH optimum from 
2.0 to 5.0 [19] albeit at the expense of a reduction of the 
specific activity to only 15%. Similar experiments in the 
direction to lower the pH optimum of xylanases were 
much less successful. In the case of the Xylanase A from 
Bacillus circulans the exchange of an asparagine residue 
to an aspartate residue lowered the pH optimum from 
5.7 to 4.6 and increased the specific activity to about 
120% [20]. A similar extent of pH optimum change was 
also seen for the Xylanase I from Streptomyces sp. Here the 
pH optimum dropped from 6.0 to 5.0 when asparagine was 
exchanged to aspartate at the homologous position. The 
mutation also resulted in a decrease of the specific activity 
to about 50%. These studies indicate that this residue in the 
neighborhood of the catalytic center is important for the 
pH optimum of the respective enzymes. In an attempt to 
lower the pH optimum of the Thermotoga maritima 
enzyme CelA we mutated valine 63, the homologous 
residue. To our disappointment the change V63T using the 
threonine residue of the Sulfolobus enzyme resulted in a 
mutated enzyme with an unchanged pH optimum but a 
largely decreased specific activity (- 88%). A second 
attempt exchanging four amino acids at this position 



(see Figure 1) resulted in a drop of the pH optimum 
to 5.5 which is one pH unit lower than the optimal pH with 
the substrate ^-Nitrophenyl-beta-D-cellobioside (data not 
shown). Our failure to achieve a substantial pH change by 
performing mutations at a selected position known to influ- 
ence the pH optimum in related enzymes motivated us to 
construct the chimeric enzymes presented in this work. 

Conclusions 

Of the two hybrid enzymes one chimera possess advanta- 
geous properties: It still exhibits a low pH optimum, a high 
temperature optimum and a high specific activity and most 
importantly it can be easily produced and purified from 
recombinant E. coli. This example suggests that the 
targeted construction of chimeric enzymes is a viable 
alternative to point mutational studies provided that parent 
enzymes with the wanted properties are available. 

Methods 

Prediction of chimeric proteins by SCHEMA 

The SCHEMA prediction requires a sequence alignment 
of protein sequences including one structure. The structure 
is required to calculate the contacts between the residues in 
the multiple sequence alignment which are then later used 
to calculate the disruption energies of chimeric enzymes. 
For the sequence alignment calculated by T-Coffee we used 
18 bacterial and archaeal endoglucanase of GH family 12 
(ZP 05098279.1, CAB06783.1, YP 001244857.1, YP_00235 
2530.1, ZP_04880023.1, CBH31132.1, NP_578583.1, NP_34 
2800.1, YP 002842958.1, CAB06782.1, YP 001541434.1, 
YP 001540672.1, YP_921079.1, YP_001 541794.1, YP_00154 
0299.1, NP_343873.1, YP_256451.1, YP_002836552.1) 
which are related to the target enzymes SS01949 
(AAK42142.1) and CelA (CAA93273.1), respectively. The 
inclusion of the related proteins improved the alignment of 
the less conserved stretches. As structural template we used 
the endoglucanase Cell 2 A from Rhodothermus marinus 
(1H0B) which displayed high sequence similarity to 
SS01949 [9]. In contrast to the normal use of SCHEMA 
we were not interested to design a library of recombination 
fragments we rather used SCHEMA for a more targeted 
approach and calculated the disruption energies for the 
generation of a chimeric protein consisting of SS01949 and 
CelA. We took into account two possibilities: a simple 
fusion of SS01949 and CelA with a single recombination 
point and a hybrid protein consisting of three frag- 
ments and two recombination points. The python scripts 
of SCHEMA were adjusted accordingly and calculated the 
disruption energies for all possible combinations. The cal- 
culated disruption energies do not correspond to a physico- 
chemical energy but rather reflects the number of residue 
contacts which are modified during hybrid construction 
since one of amino acids which are close in the structure 
changed their identity. Thus the lower the disruption 
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energy the more native residue contacts could remain 
preserved in the fusion protein. 

Cloning and expression of the hybrid proteins 

For amplification of parts of the genes ssol949 and celA 
the plasmids pET28c-CelA and pET28c-SS01949Nhis 
have been used. The N-terminal part of the hybrid protein 
CelA-SS01949-CelA was amplified by PCR with primers 
celA_BsaI.for (for sequences refer to Table 2) and 
l_1949_celA.rev, the middle part with 2_CelA_1949.for 
and 3_CelA_1949.rev and the C-terminal part with 
4_1949_CelA.for and CelA_HindIII.rev. The resulting 
gene does not encode a purification tag. 

SS01949_CelA_SS01949 was amplified with primers 
T7-Promotor and 5_CelA_1949.rev, 6_1949_CelA.for and 
7_1949_CelA.rev, 8_CelA_1949.for and T7-Terminator. 
The resulting gene does no longer encode the signal 
peptide of SS01949 but instead encodes an N-terminal 
hexahistidine peptide and the thrombin recognition 
sequence. 

The generated PCR-fragments contain overlapping ends 
for celA and ssol949. In the next step two fragments of 
each hybrid were amplified together and in the last step the 
remaining fragment was added and amplified. We created 
two hybrid genes, which were ligated into the vector 
pJET/1.2 blunt (Fermentas, St. Leon-Rot, Germany). 
The hybrid gene celA-ssol949-celA contains cleavage 
sites Bsal (which cuts outside of its recognition site 
and generates an Ncol compatible end in this construct) 
and Hindlll and ssol949-celA-ssol949 contains Nhel and 
Xhol upstream and downstream of the gene. The PCR 
products were sequenced and cloned into pET-28c 
(Novagen, Madison, WI, USA). 

The expression plasmids were used to transform 
Escherichia coli BL21 AI cells (Invitrogen). For expression, 
cells were grown overnight in 20 ml of Luria-Bertani 



Table 2 Oligonucleotides used in this study 



Name 


Sequence (5' - 3') 


celA_Bsal.for 


AGGTCTCGCATGGTGGTACTGATGACAAAACCGGGAA 


celAJHindlll.rev 


AAAGCmCATOTCTCACCTCCAGATCAATAGAG 


1_1949_celA.rev 


CAGGG^CTGCCCACATAAAACTCGGGATAAC 


2_CelA_1949.for 


GTOTCCCGAGI I I I ATGTGGGCAGAAAACCCTG 


3_CelA_1949.rev 


CTOCAAACTCG GTOCGAACTCCCAATCCG™ 


4_1949_CelA.for 


TAACGGATOGGAGTOGGAACCGAGmGGAAG 


5_CelA_1949.rev 


TOAAATAGAACCAGACCATOTCTCAATGTCCC 


6_1949_CelA.for 


GGGACATOAGATAATGGTCTGGTOTAmCAA 


7_1949_CelA.rev 


TAATAGmGTAATGTOGCAGAACTCGAAAGAG 


8_CelA_1949.for 


CTCmCGAGTOTGCTAACATOCAAACTA™ 


T7-Promotor 


TAATACGACTCACTATAGGG 


T7-Terminator 


GCTAGTOTOCTCAGCG 



medium with 50 ftg/m\ kanamycin at 37°C. After 
inoculation of 2 litres of Luria-Bertani medium, the 
incubation was continued to an A500 

of 0.6. 1 mM 

Isopropyl /?-D-thiogalactoside and 0.2% arabinose were then 
added and the culture was fermented for further 12 h at 
room temperature. Cells were harvested by centrifugation. 

Purification and refolding of hybrid proteins 

The cell pellet was resuspended in 20 ml in buffer A 
(100 mM Tris/HCl, pH 7). Cells were disrupted by a 
sonification and centrifuged for 20 min and 20 000 g. 
The cell pellet which contains the recombinant enzyme 
was washed with 20 ml Triton X-100 to remove mem- 
brane proteins. The remaining inclusion body was 
washed with 5 M and 6 M urea and solubilized in 10 ml 
buffer B (100 mM Tris/HCl pH 8, 8 M urea). After 
10 fold dilution in buffer B the protein was refolded in 
20 fold volume of refolding buffer (50 mM MES, pH 6, 
800 mM arginine) for 2 hours at 8°C. Refolding was 
done in a total volume of 50 ml. The refolded protein 
was then purified and concentrated by hydrophobic 
interaction chromatography. For this purpose the refolded 
protein solution was first slowly further diluted 4 fold with 
ddH 2 0 and then brought to 1.5 M ammonium sulfate. This 
solution was loaded onto a 0.3 ml EMD-Propyl (Merck) 
column (MoBiTec GmbH, Gottingen) and step-eluted 
with 0.5 ml of 100 mM Tris/HCl pH 8 in 0.5 ml. 
The active fractions (tested with activity plates) were 
dialysed against the storage buffer (20 mM sodium 
phosphate, pH 7.0, 50% (v/v) glycerol, 1 mM DTT, 0.1 mM 
EDTA, and 150 mM NaCl) and stored at -20°C. The pro- 
tein concentration was determined spectrophotometrically 
and calculated according to Ehresmann et al. [21]. 
Total yield of a 2 litre culture of CelA-SS01949-CelA 
was approximately 3 mg of purified protein. 

Activity plates: hydrolysis of carboxymethylcellulose 
(CMC) 

Activity plates enabled qualitative determination of 
cellulase activity of the chimeric proteins. 2.1 g Gelrite 
(Sigma) have been autoclaved in 240 ml ddH 2 0 and 30 
ml of 0.5 M sodium phosphate pH 3, 30 ml of 2% CMC 
solution and 3 ml of 1 M MgCl 2 were added and poured 
into Petri dishes. After solidification 20 \A of protein extract 
were applied and allowed to dry. The plate was incubated 
overnight at 75°C. Then, the plate was washed with 
100 mM sodium phosphate pH 6 and stained with 
Congo Red for 30 min. Next, the plate was destained 
with 1 M NaCl. An active cellulase yields a white spot on 
a red background. 

Fluorescent activity assay 

The quantification of enzyme activity was done by 
FRET (fluorescence resonance energy transfer) based 
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assay. The bifunctionalized cellohexaoside sodium A/-{2-A/- 
[(S-(4-deoxy-4-dimethylamino-phenylazophenylthioureido- 
^-D-glucopyranosyl)-(1^4)-^-D-glucopyranosyl-(1^4)- 
^-D-glucopyranosyl-(1^4)-y5-D-glucopyranosyl-(1^4)-^-D- 
glucopyranosyl- ( 1 — >4) -/^D -glucopyranosyl) -2-thioacetyl] 
aminoethyl}-l-naphthylamine-5-sulphonate was offered 
as substrate [18]. The reaction was followed by monitoring 
fluorescence, which increases in the course of the 
enzymatic reaction. The measurement was performed 
in 0.1 M sodium phosphate buffer at various pH and 
temperatures and 1 \iM substrate on a PerkinElmer 
LS50B spectrofluorometer equipped with a thermostatically 
controlled cuvette holder (80°C). Excitation was at 340 nm 
and emission was observed at 490 nm. The fluorescent 
substrate is very stable under these conditions and 
has a half-live of several hours [9]. Initial rate con- 
stants were determined at different substrate concen- 
trations in the presence of 120 ng hybrid protein. The 
Michaelis-Menten constant K m , k cat and the maximal 
velocity v max were calculated by nonlinear regression to 
the Michealis-Menten equation. 
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